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MicroRNAs (miRNAs) derive from longer precursors with fold-back structures. While animal miRNA precursors have 
homogenous structures, plant precursors comprise a collection of fold-backs with variable size and shape. Here, we design 
an approach to systematically analyze miRNA processing intermediates and characterize the biogenesis of most of the 
evolutionary conserved miRNAs present in Arabidopsis thaliana. We found that plant miRNAs are processed by four 
mechanisms, depending on the sequential direction of the processing machinery and the number of cuts required to 
release the miRNA. Classification of the precursors according to their processing mechanism revealed specific structural 
determinants for each group. We found that the complexity of the miRNA processing pathways occurs in both ancient and 
evolutionary young sequences and that members of the same family can be processed in different ways. We observed that 
different structural determinants compete for the processing machinery and that alternative miRNAs can be generated 
from a single precursor. The results provide an explanation for the structural diversity of miRNA precursors in plants and 
new insights toward the understanding of the biogenesis of small RNAs. 



[Supplemental material is available for this article.] 

In multicellular organisms, microRNAs (miRNAs) are a class of 
small RNAs of —21 nt that originate from endogenous loci and 
regulate other target RNAs by base complementarity (Voinnet 
2009). MiRNAs are distinguished from other small RNAs by their 
unique biogenesis that involves the precise excision of the stem of 
a fold-back precursor (Meyers et al. 2008; Bologna et al. 2013). Al- 
though the current evidence indicates that miRNAs have emerged 
and specialized independently in animals and plants, their bio- 
genesis depends on the recognition of structural cues located in the 
miRNA precursors (Axtell et al. 2011; Cuperus et al. 2011). 

Most plant miRNAs are encoded as independent transcrip- 
tional units. They are transcribed by RNA polymerase II and then 
capped, spliced, and polyadenylated (Xie et al. 2005). Precursor 
processing occurs in the specialized D-bodies located in the plant 
nuclei (Fang and Spector 2007; Fujioka et al. 2007; Song et al. 
2007). In Arabidopsis thaliana and other plants, the core compo- 
nent of the miRNA processing machinery is the RNase III DICER- 
LIKE 1 (DCL1), which produces the cuts on the fold-back pre- 
cursors that release the mature miRNAs (Park et al. 2002; Reinhart 
et al. 2002; Kurihara and Watanabe 2004; Liu et al. 2012). 

In addition to DCL1, several proteins contribute to the pro- 
cessing of plant precursors. DCL1 interacts with the double-strand 
RNA (dsRNA) binding protein HYPONASTIC LEAVES 1 (HYL1) 
(Han et al. 2004; Vazquez et al. 2004). SERRATE and components of 
the CAP binding complex, which are required for splicing, also 
contribute to the biogenesis of miRNAs (Lobbes et al. 2006; Yang 
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et al. 2006; Gregory et al. 2008; Laubinger et al. 2008). Furthermore, 
mutations in CPL phosphatases (Manavella et al. 2012b), DAWDLE 
(Yu et al. 2008), TOUGH (Ren et al. 2012), SICKLE (Zhan et al. 
2012), and HASTY (Bollman et al. 2003), also compromise the 
biogenesis of miRNAs in plants. 

Plant miRNA precursors constitute a wide range of structures, 
and their fold-back lengths vary between 50 and 900 nt (Bologna 
et al. 2009; Cuperus et al. 2011). In contrast, animal miRNA pre- 
cursors have a typical structure comprising a stem of —three helical 
turns, a small terminal loop, and long single-stranded RNA (ssRNA) 
sequences flanking the fold-back structure (Han et al. 2006). The 
animal microprocessor, containing the RNase III Drosha and the 
dsRNA-binding protein DGCR8/Pasha, recognizes the ssRNA- 
dsRNA junction of the precursor and produces a first cleavage 
11 bp away, defining the base of the miRNA (Han et al. 2006; Kim 
et al. 2009). The second cleavage is then performed in the cyto- 
plasm by Dicer recognizing the free 3 ' end of the precursor stem 
and cutting 22 nt (—two helical turns) from this end (Saito et al. 
2005). 

Part of the miRNA precursors in plants have a stem of —15 nt 
below the miRNA/miRNA* followed by a large internal loop, which 
serves as a structural signal recognized by the processing machin- 
ery (Cuperus et al. 2010; Mateos et al. 2010; Song et al. 2010; 
Werner et al. 2010). However, this structural determinant is not 
found in all plant miRNA precursors (Mateos et al. 2010). Fur- 
thermore, the biogenesis of the evolutionarily conserved miR319 
and miR159 starts with a first cut next to the loop and continues 
with three additional cuts in a loop-to-base direction until the 
miRNAs are finally released (Addo-Quaye et al. 2009; Bologna et al. 
2009). Further plant precursors have been shown to release other 
small RNAs in addition to the miRNAs (Kurihara and Watanabe 
2004; Zhang et al. 2010), although the underlying processing 
mechanisms are unknown. 



23:1675-1689 ©2013, Published by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/13; www.genome.org 



Genome Research 1 675 

www.genome.org 



Bologna et al. 



Although previous work has characterized the biogenesis of 
a specific subset of miRNA precursors, a genome-wide view of the 
processing of plant miRNAs is still missing. Here, we developed 
and applied an approach to systematically analyze miRNA pro- 
cessing intermediates. We could determine the processing mech- 
anism of most of the evolutionarily conserved miRNAs in Arabi- 
dopsis thaliana. We found that the biogenesis of plant miRNAs is 
more complex than previously thought, and we identified four 
processing mechanisms. Furthermore, we observed that miRNAs 
belonging to the same family can be processed by different 
mechanisms. The complexity of processing pathways found in 
ancient miRNAs is mirrored in young recently evolved small RNAs, 
in which competing RNA determinants might release alternative 
small RNAs. The results obtained here might explain the wide 
range of sizes and shapes observed in the plant miRNA precursors. 

Results and Discussion 

Specific parallel amplification of RNA ends [SPARE] 

Previous approaches to identify processing intermediates of in- 
dividual precursors in plants have been based on a modified 5'- 



rapid amplification of cDNA ends (RACE) (German et al. 2008; 
Addo-Quaye et al. 2009; Bologna et al. 2009; Mateos et al. 2010; 
Song et al. 2010; Werner et al. 2010). We sought to modify this 
approach to systematically identify processing intermediates of 
plant miRNAs. To do this, we combined (1) a specific reverse tran- 
scription primed by a mixture of oligos matching miRNA pre- 
cursors, (2) a modified 5' RACE-PCR coupled to high- throughput 
deep-sequencing, and (3) bioinformatics tools. We called this ap- 
proach SPARE (Fig. 1 A), as it is a modification of previously described 
methods (e.g., PARE) to detect poly(A)+ mRNA derived fragments 
by deep-sequencing (Addo-Quaye et al. 2008; German et al. 2008). 

We designed specific oligos that hybridize to the 3' arm of 169 
miRNA precursors, 30 nt below the miRNA/miRNA* duplex (Sup- 
plemental Table 1). Of these sequences, 90 oligos were designed 
against miRNAs present in a broad range of species, at least across 
angiosperms (designated as "conserved miRNAs"), while the 
remaining 79 were prepared against evolutionarily younger miRNAs 
present in fewer species more related to Arabidopsis (designated as 
"young miRNAs") (Cuperus et al. 2011) (Supplemental Tablel). 

Briefly, after depleting rRNA, uncapped RNAs were ligated 
with an RNA adaptor. A reverse transcription is performed using a 
mixture of the precursor-specific oligos that also contain a generic 
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Figure 1 . Construction and analysis of a miRNA precursor intermediates library (SPARE). (A) Scheme illustrating the SPARE library construction used to map 
processing intermediates and the expected results depending on the processing direction of the miRNA precursors; (1) indicates the first cleavage reaction and 
(2) the second one. (B) Modified 5' RACE PCR results from wild-type and fieryl mutants on selected miRNA precursors. (C) Scheme of the procedure to analyze 
the SPARE data. (D) Frequency of detected cuts flanking the miRNA/miRNA* region (green) and other positions along the precursor (purple). 
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adaptor tail (Fig. 1A). Precursor intermediates are next amplified 
using oligos against the RNA adaptor and the common tail of 
precursor-specific oligos and subjected to deep-sequencing (Fig. 
1A; see Methods for details). For our studies, we used Arabidopsis 
plants defective in FIERY 1, which leads to reduced XRN activities, 
and therefore have an increased accumulation of miRNA process- 
ing intermediates (Gy et al. 2007). A pilot experiment designed 
against five precursors showed that the modified 5' RACE-PCR 
produced the same amplification products in wild-type and fieryl 
plants, although the detection of intermediates for some miRNAs 
was improved in fieryl (e.g., MIR156a and MIR171b) (Fig. IB). 
These results prompted us to use fieryl in our studies. 

Next, we applied the SPARE method to a mixture of samples of 
seedlings, leaves, and inflorescences of fieryl plants. To analyze the 
results, we arbitrarily defined a miRNA precursor as the sequence 
between the miRNA and the miRNA*, plus 30 additional bases 
below the miRNA/miRNA*. We obtained reads from 18 to 40 nt in 
length. As contamination with small RNAs have been shown as a 
problem in PARE or degradome libraries (Addo-Quaye et al. 2009), 
we only focused on sequences longer than 25 nt. These longer 
sequences also allowed us to assign the reads to specific members 
of the same family. Using these stringent conditions, we obtained 
more than 30,000 reads corresponding to miRNA processing in- 
termediates in Arabidopsis thaliana (Supplemental Table 2). 

A precursor was considered "detected" if more than three 
reads corresponded to its sequence. We identified RNA fragments 
corresponding to 129 precursors, 71 of them of conserved miRNAs 
and 58 young miRNAs (Fig. 1C; Supplemental Table 1). Precursors 
that were not detected might correspond to miRNAs expressed 
under specific growth conditions or cells not represented in the 
tissues we analyzed, but we cannot rule out that some sequences in 
miRBase do not correspond to bona fide miRNAs (Axtell 2008; 
Axtell and Bowman 2008; Fahlgren et al. 2010). 

Due to the relative position of the oligos, this method allows 
the detection of the first cleavage position only if the precursor 
is processed in a base-to-loop direction, or all the cleavage in- 
termediates if the biogenesis proceeds in a loop-to-base path- 
way (Fig. 1A). We analyzed the sequence of the processing 
intermediates and mapped the corresponding cuts along the sec- 
ondary structure of the precursors (Fig. 1C). We found that >75% of 
the sequence reads corresponded to the cleavages flanking the 
miRNA/miRNA* duplexes (proximal side and distal side, located 
adjacent to the lower stem and the upper stem, respectively) along 
the precursors of conserved miRNAs. This frequency further in- 
creased to 87% if a flexibility of one base is allowed flanking the 
miRNA/miRNA* ends (Fig. ID; Supplemental Table 3), which is in 
agreement with a certain degree of heterogeneity observed in the 
sequences of the small RNAs obtained by sequencing (Nakano 
et al. 2006; Lu et al. 2010; Jeong and Green 2012). 

Next, we applied the SPARE approach to wild-type plants 
(Supplemental Table 2). The results obtained were similar to those 
of fieryl, and most of the reads mapped to the flanks of the miRNA/ 
miRNA* duplexes (Supplemental Fig. 1B-E; Supplemental Table 3). 
However, we detected a higher number of reads for some miRNA 
precursors in fieryl, which is consistent with the previous findings 
showing that the miRNA precursor intermediates accumulate in 
this mutant (Gy et al. 2007). Altogether, the results show that our 
method detects precursor intermediates that are processed to give 
rise to miRNAs as detected by small RNA sequencing. In principle, 
additional cuts that do not match the flanking positions of the 
miRNA/miRNA* might correspond to misprocessing reactions, 
precursor decay, or productive processing reactions that start far- 



ther away from the miRNA/miRNA* such as those occurring during 
miR319 biogenesis (Addo-Quaye et al. 2009; Bologna et al. 2009). 

Identification of precursors processed in a base-to-loop 
direction and structural determinants characterization 

The design of the SPARE library determines that only the first cut is 
detected in precursors cleaved in a base-to-loop processing (Fig. 
1A). We then focused on those precursors with reads located only 
at the proximal side of the miRNA/miRNA* (Fig. 2A). We detected 
32 precursors of conserved miRNAs with reads representing the 
proximal end of the miRNA/miRNA* without detecting cuts in the 
upper part of the duplex (Fig. 1A; Table 1; Supplemental Table 3). 

Approximately 85% of the reads detected in these precursors 
matched to the exact proximal site of the miRNA/miRNA* (Fig. 
2A,B; Supplemental Fig. IB), which could be extended to —90% if 
we include in this analysis the two positions at each side of the 
proximal site (Fig. 2B, inset). Our result is consistent with the 
analysis of small RNA libraries that has shown that miRNA variants 
shifted one or two bases are also easy to detect (Nakano et al. 2006; 
Lu et al. 2010; Jeong and Green 2012) and further confirms that 
many of these variants are likely generated by certain variability 
during the precursor processing. Interestingly, slight differences in 
the sequence of the mature miRNAs might have relevant impli- 
cations in the selection of target RNAs (Palatnik et al. 2007; Jeong 
and Green 2012). 

Genetic analyses have shown that positioning of the initial 
DCL1 processing event in many precursors is dependent on a lower 
stem of —15 bp below the miRNA/miRNA* duplex followed by 
a large internal loop (Cuperus et al. 2010; Mateos et al. 2010; Song 
et al. 2010; Werner et al. 2010). We looked at the secondary struc- 
tures of the 32 precursors analyzed in this section and found that all 
of them have a clear lower stem (see below) (Fig. 2C), with the ex- 
ception of certain MIR166 family members (Supplemental Fig. 2). 
However, the miR166/miRl 66* duplexes are specifically recognized 
by AGO 10 (Zhu et al. 201 1), and therefore, it might be interesting 
to study whether other internal loops in the MIR166 precursors 
have also additional functions during the miRNA biogenesis. 

Previous structural determinants were identified by averaging 
the secondary structure of all plant precursors, so we thought that 
other features might be uncovered if we analyze the precursors 
experimentally validated to be processed in a base-to-loop di- 
rection. To do this, the proximal side of the miRNA/miRNA* was 
defined as +1, and we analyzed the secondary structure of each 
nucleotide from positions -25 to +40 (Fig. 2C). A lower stem of 15 
nt, as well as a structured region containing the miRNA/miRNA*, 
could be identified in both, the average of all conserved miRNAs 
and the experimentally validated base-to-loop precursors (Fig. 2C). 
However, the signal for structural determinants was clearer in the 
base-to-loop precursors group, which allowed us to search for ad- 
ditional features prevalent in these precursors. We observed that 
the bases immediately below the miRNA/miRNA* tend to be un- 
paired (positions -2 to -1), separating two double-stranded seg- 
ments, one harboring the lower stem and the other the miRNA/ 
miRNA* (Fig. 2C). Furthermore, positions -4 to -3 and the prox- 
imal three bases of the lower stem (positions -15 to -13) were 
paired in most cases, likely providing a clamp at the ends of the 
structured region of the lower stem. The transition from the lower 
stem to the single-stranded sequences below this region was 
sharper in the precursors validated as being processed by a 
base-to-loop mechanism than in the average of all conserved 
miRNAs (Fig. 2C). In turn, the structured region corresponding to 
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Figure 2. Identification and characterization of miRNA precursors processed in a base-to-loop direction. (A) Scheme showing the secondary structure of 
MIR168a, MIR172b, and MIR395b. The arrows indicate the positions and number of reads of the precursor cuts identified. Green arrows show the most 
abundant cleavage site detected, which also corresponds to the proximal site of the miRNA/miRNA*. Black arrows show other cleavage sites of at least 5% 
abundance of the total reads, while other minor cuts are shown in gray. A lower stem structured region of ~1 5 nt below the proximal cut is highlighted 
with a pink box. The miRNAs are indicated in red and the miRNAs* in blue. The inset on the right shows the typical cleavage pattern detected in the SPARE 
library for these precursors. (B) Distribution of cuts along the precursor sequence for specific miRNA families and all conserved miRNAs detected as being 
processed in a base-to-loop direction. The proximal side of the miRNA/miRNA* is defined as +1 . The inset shows the frequency of cuts at the proximal site of 
the miRNA/miRNA* and the two positions next to it. (C) Secondary structure of the precursors detected to be processed in a base-to-loop direction. The 
structures were obtained from mfold (Zuker2003; http://www.bioinfo.rpi.edu/applications/mfold/), and the matches in each position were considered as 
0, while bulges and mismatches were considered as 1 . The secondary structure considering all conserved miRNAs is indicated as a purple line. 



the miRNA/miRNA* tends to accumulate unpaired bases at the 
central part (9 to 11), and certain positions (6, 17, and 21) (Fig. 2C). 
Overall, our results show that the precursors processed in a base-to- 
loop direction are more uniform than previously thought and that 
at least some of the precursors not detected as base-to-loop likely 
have other specific RNA determinants. 



Sequential base-to-loop processing 

The MIR169 family has 14 members, being the largest MIRNA 
family of Arabidopsis thaliana. The miRNAs of this family could be 
classified in four groups according to their mature sequences (Fig. 
3 A). All these small RNA sequences are detected in vivo, with 
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Table 1. Processing mechanisms of Arabidopsis MIRNAs 

MIRNA Structured regions Mechanism 



Conserved MIRNAs 



MIR156a 


Upper stem 


MIR156b 


Upper stem 


MIR156C 


Upper stem 


MIR156d 


Upper stem 


MIR156h 


Upper stem 


MIR159b 


Upper stem 


MIRHOa 


Upper stem 


MIR160b 


Upper stem 


MIR160C 


Upper stem 


MIR162b 


Upper and lower stem 


MIR164b 


Lower stem 


MIR164c 


Lower and upper stem 


MIR165a 


Lower stem 


MIR165b 


Lower stem 


MIR166a 


Unclear 


MIR166b 


Upper stem 


MIR166e 


Unclear 


MIR166f 


Upper stem 


MIR167a 


Lower stem 


MIR167b 


Lower stem 


MIR167d 


Lower stem 


MIR168a 


Lower stem 


MIR168b 


Lower stem 


MIR169a 


Lower stem 


MIR169b 


Lower stem 


MIR169d 


Lower stem 


MIR169e 


Lower stem 


MIR169f 


Lower stem 


MIR169g 


Lower stem 


MIR169j 


Lower and upper stem 


MIR169I 


Lower and upper stem 


MIR169m 


Lower and upper stem 


MIR169n 


Lower and upper stem 


MIR170 


Lower stem 


MIRWla 


Lower stem 


MIRWlb 


Upper stem 


MIRUlc 


Upper stem 


MIR172a 


Lower stem 


MIR172b 


Lower stem 


MIR172d 


Lower stem 


MIR172e 


Lower stem 


MIR319a 


Upper stem 


MIR319b 


Upper stem 


MIR319C 


Upper stem 


MIR390a 


Lower stem 


MIR390b 


Lower stem 


MIR391 


Lower stem 


MIR393a 


Lower stem 


MIR393b 


Lower stem 


MIR394a 


Lower and upper stem 


MIR394b 


Lower and upper stem 


MIR395a 


Lower stem 


MIR395b 


Lower stem 


MIR395C 


Lower stem 


MIR396a 


Lower stem 


MIR396b 


Lower stem 


MIR397a 


Lower stem 


MIR398b 


Lower stem 


MIR398C 


Lower stem 


MIR399b 


Lower stem 


MIR399C 


Lower stem 


MIR408 


Lower stem 


MIR827 


Lower stem 



Short loop-to-base 
Short loop-to-base 
Short loop-to-base 
Short loop-to-base 
Short loop-to-base 
Sequential loop-to-base 
Short loop-to-base 
Short loop-to-base 
Short loop-to-base 
Probable short loop-to-base 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Unclear 
Unclear 
Unclear 
Unclear 

Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short loop-to-base 
Short loop-to-base 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Sequential loop-to-base 
Sequential loop-to-base 
Sequential loop-to-base 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Sequential base-to-loop 
Sequential base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Unclear 

Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 
Short base-to-loop 



Young MIRNAs 
MIR158a 

MIR161 (miR161.1) 
MIR161 (miR161.2) 



Lower stem 

Lower and upper stem 

Lower and upper stem 



Short base-to-loop 
Short base-to-loop 
Short base-to-loop 



(continued) 



miR169a being the most abundant one 
(Fig. 3B). While we identified MIR169a 
as a miRNA precursor processed in a 
base-to-loop direction displaying a clear 
structured lower stem (Fig. 3C; Table 1), 
we did not detect any cuts flanking the 
miRNA/miRNA* in the rest of the pre- 
cursors of this family (Fig. 3C; Supple- 
mental Figs. IE, 3A). 

A more detailed inspection of the 
data showed that the cleavage sites in 
the precursors of MIR 169b/d/e/f/g were 
located —21 nt below the proximal 
side of the miRNA/miRNA* duplex 
(Fig. 3C; Supplemental Fig. 3 A). This is 
the expected distance between two 
DCL1 cleavages, suggesting these pre- 
cursors are processed by more than 
two cuts of the enzyme. The fact that 
we detected only one main cut in- 
dicates that the processing direction is 
base-to-loop (Fig. 1A; Fig. 3C). In- 
spection of the predicted secondary 
structure of these precursors revealed 
the existence of an internal loop fol- 
lowed by an ~ 15-nt lower stem below 
the detected cuts. These results sug- 
gest that the miRNA processing com- 
plex recognizes the cleavage site of 
the first cut in a similar way to the 
short precursors processed by a base- 
to-loop mechanism, by counting a 
15-nt stem region above the internal 
loop. However, in contrast to the 
canonical base-to-loop precursors, 
these MIR169 precursors suffered 
three dicing events, and the miRNA 
is released by the second and third 
cuts (Fig. 3C). In agreement with this 
possibility, small RNAs have been 
cloned that correspond to the frag- 
ments cleaved by the first and sec- 
ond cleavage sites (Fig. 3C; Zhang 
et al. 2010). The secondary structure 
of MIR394 precursors and their cleav- 
age pattern suggest that the MIR394 
family is also processed by a long base- 
to-loop mechanism (Supplemental Fig. 
3B; Table 1). 

Interestingly, there is a third set 
of MIR169 precursors (MIR169h-n) that 
contains a lower stem but also a struc- 
tured upper stem (Table 1; Fig. 3C; 
Supplemental Fig. 3 A). These members 
of the MIR169 family are detected at 
low frequency in small RNA databases 
and in our library (Fig. 3B,C; Supple- 
mental Fig. 3 A). Still, we found most 
of the cuts located —21 nt below the 
miRNA/miRNA* (Fig. 3C; Supplemen- 
tal Fig. 3A), suggesting that they are also 
processed by a sequential base-to-loop 
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Table 1. Continued 
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MIR163 


Lower and upper stem 


Sequential base-to-loop 


MIR400 


Lower and upper stem 


Short loop-to-base 


MIR403 


Lower stem 


Short base-to-loop 


MIR447a 


Lower stem 


Sequential base-to-loop 


MIR447b 


Lower stem 


Sequential base-to-loop 


MIR472 


Lower stem 


Sequential base-to-loop 


MIR771 


Lower stem 


Short base-to-loop 


MIR779 


Lower and upper stem 


Sequential loop-to-base 


MIR824 


Lower stem 


Short base-to-loop 


MIR825 


Lower and upper stem 


Short loop-to-base 


MIR862 


Lower and upper stem 


Sequential base-to-loop 


MIR864 


Lower stem 


Short base-to-loop 



mechanism, with the possibility that the cuts continue along the 
structured upper stem after the miRNAs are released. Altogether, these 
results show the existence of another miRNA precursor processing 
mechanism, which is a modification of the two-step base-to-loop 
pathway, and requires several cleavage reactions. Furthermore, it 
might be interesting to study whether MIR169a and MIR169b-h, 
which have a different biogenesis, also have distinct properties as 
small RNAs. 



Identification of short precursors processed in a loop-to-base 
direction 

The precursors described above have one major cleavage site 
detected in our library, which is expected for the base-to-loop 
processing mechanism. In principle, precursors processed in a 
loop-to-base direction have all their cleavage sites detected by our 
approach. Therefore, to find miRNAs whose biogenesis starts from 
the loop, we searched for precursors with cuts at both sides of the 
miRNA/miRNA* (Figs. 1A, 4A). We found 16 precursors of con- 
served miRNAs with detectable cuts (>5%) in the distal side of the 
miRNA/miRNA* (Table 1; Fig. 4A,B; Supplemental Fig. 1C). With 
the exception of two miRNAs (MIR396a and MIR162b), these pre- 
cursors did not have any obvious structured region below the 
miRNA (Fig. 4C). Among these miRNAs, we also detected members 
of the MIR319/MIR1S9 families that have long upper stems and are 
processed by four DCL1 cuts in a loop-to-base direction (Fig. 4B; 
Supplemental Fig. ID; Addo-Quaye et al. 2009; Bologna et al. 
2009), confirming that our method allows the discrimination of 
the different processing directions. 

We were interested in finding new miRNAs processed in a 
loop-to-base direction. Twelve precursors were identified that 
harbor a short upper stem above the miRNA/miRNA*, therefore 
differing from the \ongMIR319 and MZR 2 59 precursors. These newly 
identified precursors were processed by only two cuts, instead of the 
four observed in the case of MIR319 and MIR159. Among these 
miRNAs, we found the members of the MIR156 and MIR1 60 families 
(Table 1; Fig. 4A,B), known to regulate SPL and ARF transcription 
factors, respectively (for review, see Jones-Rhoades et al. 2006; Axtell 
and Bowman 2008). We also observed that the cuts of members of 
the MIR1S6 family always release miRNA/miRNA* duplexes with 
asymmetric bulges and containing miRNAs* of —22 nt and miRNAs 
of —20 nt (Fig. 4 A), in good agreement with previous results char- 
acterizing these small RNAs and their ability to generate secondary 
siRNA (Manavella et al. 2012c). 

The short precursors processed in a loop-to-base direction 
have a structured terminal region (Fig. 4C) that has a homoge- 



neous size of —42 nt that includes a 
short loop, in contrast to the same 
region in the precursors processed in a 
base-to-loop direction, which is quite 
variable (Fig. 4D). Previous analysis 
of the MIR319 precursors revealed that 
they have a conserved region of the 
stem above the miRNA/miRNA*, which 
can generate other small RNAs (Axtell 
et al. 2007) and is necessary for the 
biogenesis of the miR319 (Bologna 
et al. 2009). We did not observe any 
sequence conservation in the short 
loop-to-base precursors outside the 
miRNA/miRNA*. These observations 
suggest that there are differences in 
the MIR319/MIR159 precursors and those identified here, al- 
though the data show that plant miRNAs are frequently gener- 
ated by a loop-to-base processing mechanism. 

Mixed processing of members of the MIR170I MIR171 family 

In general, we found that different members of a miRNA family 
share their biogenesis pathway (Table 1; Figs. 2B-4B). This obser- 
vation is not surprising since miRNA families are thought to ex- 
pand by duplication events of an ancestral miRNA gene (Allen 
et al. 2004; Maher et al. 2006). However, we noticed that the pro- 
cessing signatures of the members of certain families could vary 
from each other. In addition to the MIR 169 family (Fig. 3C), we 
found that the MIR170/MIR171 precursors also have different 
processing signatures. A detailed inspection of the processing in- 
termediates for MIR170 and MIR171a revealed cuts only at the 
lower end of the miRNA/miRNA*, indicative of a base-to-loop 
biogenesis pathway (Fig. 5; Supplemental Fig. 4A) and in agree- 
ment with previous data on MIR171a (Song et al. 2010). In con- 
trast, we found that MIR1 7 lb and MIR 1 71c precursors had cleavage 
sites at both ends of the miRNA/miRNA* duplex, which is com- 
patible with a loop-to-base processing mechanism (Fig. 5; Sup- 
plemental Fig. 4A). That individual miR170/171 miRNAs have 
potentially different biogenesis mechanisms prompted us to study 
this family in more detail. 

The MIR170/MIR171 family is composed of four miRNAs in 
Arabidopsis thaliana and is known to regulate Scarecrow-like tran- 
scription factors. The family members differ in the nucleotide se- 
quences of their mature miRNA (Fig. 5 A), and therefore, their ex- 
pression in plants can be validated by deep-sequencing small RNAs 
(Fig. 5B). We overexpressed different family members in Arabi- 
dopsis thaliana. Plants harboring a 35S:MIR171a transgene had 
darker and fewer leaves and branches than wild type (Fig. 5C-E; 
Song et al. 2010). To quantify the phenotypes caused by miR171 
overexpression, we counted the number of cauline leaves in at least 
50 independent primary transgenic plants, as has been done pre- 
viously (Fig. 5E; Song et al. 2010). Overexpression of MIR171b 
caused a similar phenotype to MIR171a, as expected by their sim- 
ilar sequences (Fig. 5A,C,E). 

Next, we analyzed the structural determinants required for 
the processing of MIR171a and MIR171b precursors. We observed 
that MIR171a has a 15-nt lower stem, while MIR171b has a struc- 
tured upper stem, in agreement with their processing direction 
starting from the base and the loop, respectively (Fig. 5C; Supple- 
mental Fig. 4A). First, we deleted the region below the miRNA/ 
miRNA* in both precursors (Fig. 5D). We found that the biogenesis 
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Figure 3. Single and sequential processing of MIR 1 69 family members. (A) Sequences of miRI 69 small RNAs. (B) Relative abundance of the small RNAs 
determined by deep-sequencing small RNAs from wild-type plants. (C) Scheme showing precursors of selected MIRI 69 family members. The arrows 
indicate the positions and number of reads corresponding to the miRNA precursor cleavage sites identified. The most abundant cut is indicated by a green 
arrow. Gray arrows show other less abundant cleavage sites. A lower stem structured region of ~1 5 nt below the first cut is highlighted with a pink box. 
Additional small RNAs detected by deep-sequencing small RNAs are indicated in yellow. The miRNAs are indicated in red and the miRNAs* in blue. 



of miRI 71a was completely abolished in this mutant precursor, 
and no obvious phenotype was detected among 50 independent 
transgenic plants (Fig. 5E). In contrast, the biogenesis of miRI 7 lb 
was unaffected after removing the lower region (Fig. 5D-F) ; and the 
resulting transgenic plants had a typical miRI 71 overexpression 
phenotype (Fig. 5E). Then, we eliminated most of the region above 
the miRNA/miRNA* in both precursors. In this case, miRI 71a over- 



expression was unaffected by the modification, while the biogenesis 
of miRI 7 lb was completely impaired in all the transgenic plants 
analyzed (Fig. 5D-F). Determination of the MIRI 71 precursors by RT- 
qPCR showed that they were all expressed (Supplemental Fig. 4B), 
confirming that the processing of the mutant precursors was affected. 

These results experimentally link the processing direction 
found in the SPARE library, the structural features in the precursors' 



Genome Research 1 681 



www.genome.org 



Bologna et al. 



B 



MIR156a 



5'AA U AC- AAC AA- A A UU UA 

GAG UGGG AAGAGA GCA GAA CUGACAG AAGAG AGUGAGCAC CAA AGGCAA UGCA U 
UUC GUCC UUCUCU CGO CUU GACUGUC UUCUC UCACUCGUG GUU UUCGUU ACGU C 

3'C- U GGU AGU GGC AUG C CUC C- UA 



t 



MIR156C 



5 ' GGGACAAAUUUUAAGAGAAAC AUA A A CUU GU 

GC GAA CUGACAGA AGAG AGUGAGCAC CAA AGGCA UGCAU U 
CG CUU GACUGUCU UCUC UCACUCGUG GUU UUCGU ACGUA \ 
3 ' GCACUGGCCCUGGCUUAGCCU -GC A AG C CUC -UU GC 



t 



t 



MIR156d 



3 13 

26 



5 ' GAUAAAUAUAU G AAA — AG AGU A 

GUU AUGGGGGA GAAGU UGACAGAA GAGAGUGAGCAC CAAAGG GA UGUAUAAA \ 

UAA UACUUCCU CUUCA ACUGUUUU CUCUCACUCGUG GUUUUC UU AUAUGUUU G 
3' CUUAUUCCUUC G C G GGU U 



Detection of both 
proximal and distal cuts 



3 Pi 3 2 

51 46 



t 




miRNA/ 
miRNA* 



distal 
cut 

proximal 
cut 
3' 



16 conserved MIRNAs 



MIR160a 

5'-- U- 



- U- A UAUAUAU C CU A U CC AG 

CA AUGUAG UGUAUAUGUAU GUAUGC UGGCUCC GUAUGCCAU UGC GAG CAUCG \ 

GU UGUAUU AUAUAUACAUA UAUACG ACCGAGG UAUGCGGUA GUG CUC GUAGC U 

3'UG UU A CUCC AG G C CA UA 



MIR319a 



227 



t 

100 



5'GAUGUUUUA- UAUGU- A U C UCGUGAU AG- U AC C AA C GCCAAAA 

GGAAUAUA AGAG GAGCUUCCUU AGUCCA UCA AGG AUGAUUCAAUU CU CCG UCAUUCAU CA UA CGAGUC U 

CCUUAUGU UCUC CUCGAGGGAA UCAGGU AGU UCU UACUAGGUUAA GA GGC AGUAAGUA GU AU GCUCAG U 

3'GUUCUUUUAA UUUCUC C GUU CUUAGU- ACA U GU A AA U AUCAAAC 



|0-5 



CD 
03 



% o. 



MIR1 56 family 1 
i loop-to-base average 
i MIR159/MIR319 family 




A 



-25 -20 -15 -10 



10 15 20 25 30 35 
Precursor length 



40 



45 



50 55 



60 



65 70 



(Unpaired) 1 



(Paired) 0 



Lower stem 



miRNA/miRNA* 



Loop 



lllilililll.lilllllllll 



llli. ■! ■ ■ f ■ ■ ■ ■ 



III. Ill 



-25-23-21 -19-17-15-13 -11 -9 -7 -5 -3 -1 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 



v 
r 



Precursor length 



300- 
S150 

<D 
N 

100- 

q. 
o 

-3 50- 



Base- Short 
to- loop- 
loop to-base 



Figure 4. Loop-to-base processing of plant miRNAs. (A) Scheme showing the secondary structure of MIR 1 56a/c/d, MIR160a, and MIR3 1 9a. The arrows 
indicate the positions and number of reads corresponding to the miRNA precursor cleavage sites identified. Green arrows show the most abundant 
cleavage site detected, which also match to the proximal and distal sides of the miRNA/miRNA* duplex of MIR156 and MIR1 60 and four DCL1 cuts of 
MIR3 1 9. Black arrows show other cleavage sites of at least 5% abundance of the total reads, while other minor cuts are shown in gray. A gray box highlights 
the structured upper stem of MIR 1 56 and MIR160. The miRNAs are indicated in red and the miRNAs* in blue. (B) Distribution of cuts along the precursor 
sequences of the average of all conserved miRNAs detected as loop-to-base and individual miRNA families. The proximal side of the miRNA/miRNA* was 
defined as +1 . Note that MIR159/3 1 9 precursors have two additional cuts. (C) Secondary structure of the precursors detected to be processed in a loop-to- 
base direction. The structures were obtained from mfold, and the matches in each position were considered as 0, while bulges and mismatches were 
considered as 1 . (D) Size of the terminal region in base-to-loop (light blue) and short loop-to-base (yellow) precursors. 



secondary structure, and the RNA domains required for the bio- 
genesis of the miRNAs. While MIR171a is processed in a base-to-loop 
direction and harbors a lower stem necessary for the biogenesis of the 
miRNA, MIR171b is processed in a loop-to-base direction and con- 



tains a structured upper stem necessary for the biogenesis of the 
miRNA. Analysis of the predicted secondary structure of MIR171 
from other plant species such as rice (Supplemental Fig. 4C) 
revealed that some of them have structural features similar to 
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Figure 5. Mixed processing of MIRI 70/1 71 family members. (A) Sequences of miRI 70/1 71 miRNAs. (B) Relative abundance of the small RNAs by deep- 
sequencing small RNAs from wild-type plants. (C) Scheme showing the secondary structure and cleavage sites detected in the family members. Structured 
regions in the lower stem (MIRI 70, MIRI 71a) and upper stem (MIRI 71 b/c) are highlighted with pink and gray boxes, respectively. The most abundant 
cleavage sites detected, which also release the miRNA/miRNA*, are indicated by green arrows. Gray arrows show other cleavage sites that are less 
abundant. The miRNAs are indicated in red and the miRNAs* in blue. (D) Scheme of MIRI 71a/b mutant precursors. (£) Phenotypes of transgenic Ara- 
bidopsis plants transformed with wild-type and mutant MIRI 71a/b precursors under the control of the 35S promoter. A representative transgenic plant 
overexpressing each construct is shown on the left. The panels show the cauline leaves' number distribution observed in at least 50 independent transgenic 
plants overexpressing each construct. (F) Small RNA blots showing the accumulation of miRI 71 in transgenic Arabidopsis seedlings expressing the different 
mutant precursors. (UR) Upper region; (LR) lower region. Each sample is a pool of at least 20 independent transgenic plants. 
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MIR170/MIR171a, while others to MIR171b/c, suggesting that the 
mixed processing of the MIR171 family members is widespread in 
plants. Interestingly, miRl 70/1 71a and miR171b/c are offset by 
three bases (Fig. 5 A), and miRl 71a* can be incorporated into AGO 1 
complexes and regulate another target SUVH8 (Manavella et al. 
2012a); it is tempting to speculate that perhaps this shift in their 
sequences appeared as a consequence of the diversification of the 
processing pathways of the family. 

Processing of young miRNA precursors 

We next looked at the biogenesis of young miRNAs that are present 
in Arabidopsis and related species. Young miRNAs are usually ex- 
pressed at low levels and, therefore, are difficult to detect (Rajagopalan 
et al. 2006; Fahlgren et al. 2007, 2010; Addo-Quaye et al. 2008). We 
found —10,000 reads corresponding to processing intermediates 
of 58 different young miRNAs (Table 1; Fig. 6; Supplemental Table 
3). We observed that MIR158a and MIR824 precursors had de- 
tectable cuts in the proximal side of the miRNA/miRNA* and a 
~15-nt lower stem, consistent with a base-to-loop processing 
mechanism (Fig. 6A,B). MIR163 was also processed in a base-to- 
loop direction; however, the miRNA was released after the second 
and third cuts (Fig. 6C), in agreement with previous studies 
(Kurihara and Watanabe 2004). The size of the terminal region of 
these precursors is fairly different; MIR158a has a length of 22 nt, 
and MIR824 and MIR163 have 569 nt and 254 nt, respectively, 
confirming that precursors processed by a base-to-loop mecha- 
nism have variable loop sizes (Figs. 4D, 6A,C). Although no in- 
trons have been reported to occur in the loops of these precursors 
(Kurihara and Watanabe 2004; Szarzynska et al. 2009), we cannot 
rule out that large loops could be reduced by a splicing reaction, as 
has been shown to occur in rice nat-miRNAs (Lu et al. 2008). 

We found that MIR400 and MIR779 reads were flanking 
the miRNA/miRNA* consistently with a loop-to-base processing 



direction (Fig. 6D,E). Our data also showed that MIR779 is pro- 
cessed by several cuts. We think that this processing activity on the 
miRNA precursors is determined by its secondary structure, which 
in the case of MIR779, MIR319, and MIR159 consists of an imper- 
fect long dsRNA (Bologna et al. 2009). While MIR400 has evolved 
from the inverted duplication of their targets encoding PPR pro- 
teins (Allen et al. 2004; Rajagopalan et al. 2006; Fahlgren et al. 
2007), MIR779 seems to have evolved from random sequences 
(Felippes et al. 2008). In both cases, however, the precursors con- 
tain a structured terminal loop of —40 nt. Therefore, the loop-to- 
base processing mechanism can be responsible for the biogenesis 
of small RNAs with a different evolutionary origin. Altogether, our 
results show that the different mechanisms and RNA structural 
determinants recognized during the processing of ancient miRNAs 
can also be used during the biogenesis of miRNAs that evolved 
recently. 

Partial cleavage and misprocessing of plant precursors 

The design of the SPARE library determines that the detected DCL1 
cuts correspond to the 3' arm of the miRNA precursors (Fig. 1A). 
Still, a proportion of the reads are mapped to the 5 ' arm of the 
precursors. These cuts might correspond to other pathways com- 
peting with the biogenesis of miRNAs, such as precursor decay. 
However, we also found cuts on the 5' arm at the expected DCL1 
cleavage sites of several miRNA precursors such as MIR171a/b 
(Fig. 5C), MIR393b (Fig. 7A), and MIR166b (Fig. 7B). 

Detection of cuts on the 5 ' arm at the flanks of the miRNA/ 
miRNA* implicate the existence of intermediates containing a ter- 
minal loop with the miRNA or miRNA* as a single-stranded RNA 
extension (Fig. 7A,B, see scheme on the right). To confirm these 
results and study the existence of these partial intermediates in 
more detail, we turned to another approach based on self-ligation 
of the RNA followed by an RT-PCR, which detects both ends of the 
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Figure 6. Processing of young miRNAs. Scheme showing the precursors of MIR158a (A), MIR824 (B), MIR163 (C), MIR400 (D), and MIR779 (£). The 
arrows indicate the positions and number of reads of the precursor cuts identified. Green arrows show the most abundant cleavage site detected, which 
also matches to the proximal and distal sides of the miRNA/miRNA*. Gray arrows show other cleavage sites that are less abundant. Structured regions in 
the lower stem (MIR158a, MIR824, and MIR163) and the upper stem (MIR400 and MIR779) are highlighted with pink and gray boxes, respectively. The 
miRNAs are indicated in red and the miRNAs* in blue. 
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Figure 7. Partial cleavage and misprocessing of plant precursors. (A,B) Processing intermediates of MIR393b(A) and MIR166b(B) detected by the SPARE 
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RNA molecule (Fig. 7C; Basyuk et al. 2003). We applied this ap- 
proach to plants overexpressing a MIR172a precursor whose pro- 
cessing has been studied in detail (Mateos et al. 2010; Werner et al. 
2010), using oligos located in the loop region (Fig. 7D). 

Using this technique, we detected the terminal loop of MIR172a 
as the most common intermediate (Fig. 7E). Several clones were also 
detected in which the miRNA* has been released from the precursor, 
while the miRNA continued being part of it (Fig. 7E). These results are 
consistent with the partial DCL1 cleavage found in our library. RNase 
III enzymes, such as DCL1, contain one processing center with two 
RNA cleavage sites that generate the products with 2-nt 3' overhangs 
(Zhang et al. 2004). The functional implications of this partial 
cleavage remain to be elucidated; however, similar intermediates 
produced by partial cleavage have also been detected in animal cells 
(Bracken et al. 2011; Gurtan et al. 2012). 

We also detected cuts in the central region of miR172/miR172*. 
Analysis of these intermediates showed that they have 2-nt 3' over- 



hangs, indicating that they are likely the result of the misprocessing 
of a MIR172a precursor by RNase type III enzymes (Fig. 7E). We then 
looked at the secondary structure of the precursor and found that 
there is a 3-nt bulge ~ 15 nt below one of these cuts, suggesting that 
the cleavage might be the result of a misprocessing by DCL1 
(Fig. 7D,E). Previous work has already shown that DCL1 can 
guide the cleavage of additional sequences that contain a bulge 
followed by a 15-nt stem (Songetal. 2010); however, in this case, 
this additional cut would be a competing reaction inactivating 
the miRNA. 

Multiple RNA recognition patterns by the plant miRNA 
processing machinery 

That our library often detected cuts not involved or required for the 
release of the small RNAs (Figs. 2-6) and that the cycle RT-PCR on 
MIR172a confirmed that some of these cuts contained the 2-nt 3' 
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overhangs (Fig. 7E) prompted us to study 
the possibility of misprocessing in vivo of 
miRNA precursors in more detail. MIR825 
is a young MIRNA that seems to be pro- 
cessed by a short loop-to-base mechanism 
(Fig. 8A). However, we also observed the 
abundant reads for this precursor corre- 
spond to a cut inside the miRNA/miRNA* 
(Fig. 8A, see orange arrow). This addi- 
tional cleavage site is located above a 15- 
nt lower stem, suggesting the processing 
machinery can recognize this precursor 
also from a base-to-loop direction (Fig. 8A). 
An alternative small RNA from the other 
strand of the precursor can also be detected 
by deep-sequencing small RNAs, albeit to 
a lower level than miR825 (Fig. 8A). The 
cleavage pattern and small RNA data 
therefore suggests that MIR825 is being 
processed via a dual mechanism, one 
starting from the loop and another from 
the base, which in turn results in the 
generation of the two different miRNAs. 
Similarly, miR161.1 and miR161.2 are 
also overlapping miRNAs generated from 
the same precursor, which likely result 
from the recognition of alternative struc- 
tural determinants at the base of the 
precursor (Supplemental Fig. 5 A). Fur- 
thermore, we found that MIR472 is pro- 
cessed through a sequential base-to-loop 
mechanism, according to the reads lo- 
cated 21 nt below miRNA/miRNA* and 
the presence of a 15-nt lower stem (Supple- 
mental Fig. 5B). However, another abundant 
cleavage site inside the miRNA/miRNA* re- 
gion was also detected, which is likely the 
consequence of the recognition of a second 
internal loop (Supplemental Fig. 5B). Non- 
productive cuts consistent with the rec- 
ognition of alternative structural de- 
terminants were also found in conserved 
miRNA precursors, such as MIR 1 66 f (Sup- 
plemental Fig. 5C). 

To study the recognition of alterna- 
tive structure determinants alongside one 
precursor, we focused on the MIR319a 
precursor, which is processed by four 
DCL1 cuts starting below the loop (Fig. 
8B; Addo-Quaye et al. 2009; Bologna et al. 
2009). We generated different mutants of 
this precursor and expressed them in 
Arabidopsis thaliana using the 35S promoter 
(Fig. 8B,C). As we previously reported, de- 
leting the terminal region of this precursor 
completely abolished the biogenesis of this 
miRNA (short MIR319) (Fig. 8B-D; Bologna 
et al. 2009). Next, we mapped the pro- 
cessing intermediates of this mutant pre- 
cursor by 5' RACE PCR and found that the 
cuts were mainly located in the miRNA/ 
miRNA* region (Fig. 8B; Bologna et al. 
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2009). The positions of these cuts were above an internal bulge 
followed by a 15- to 16-nt stem, suggesting that the miRNA pro- 
cessing machinery is recognizing an alternative structural de- 
terminant in the absence of a terminal region (Fig. 8B, pink box). 

To test this possibility, we generated two additional derivatives 
of the MIR319a precursor. We first generated a short MIR319 pre- 
cursor with a lower stem of —15 nt, below the miRNA (short 
MIR319+LS) (Fig. 8B). Next, we further extended the lower stem 
below the large internal loop (short MIR319+LS2) (Fig. 8B). These 
two precursors partially recovered the activity of MIR3 19a in vivo 
(Fig. 8C) and expressed a functional miRNA (Fig. 8D). The precursor 
with the bulge in the lower stem (short MIR319+LS2) seemed to be 
processed with a slightly higher efficiency (Fig. 8C,D). Mapping of the 
processing intermediates of these precursors revealed that the cuts 
were mainly located below the miRNA/miRNA* which is consistent 
with a switch to a base-to-loop processing mechanism. These results 
confirm the existence of different competing RNA domains in the 
same molecule that can be recognized by the miRNA processing 
machinery with the stronger of them guiding the biogenesis path- 
way although in some cases one precursor might generate two dif- 
ferent small RNAs from the recognition of alternative determinants. 

Conclusions 

We have developed an approach to systematically analyze miRNA 
processing intermediates in Arabidopsis thaliana that could be 
easily applied to other experimental systems. Using this strategy 
we characterized four processing pathways that generate most of 
the miRNAs in Arabidopsis (summarized in Table 1). We found that 
these pathways present several characteristics: 

1. A short base-to-loop pathway which involves the recognition 
of a bulge followed by —15 nt of a lower stem (Mateos et al. 
2010; Song et al. 2010; Werner et al. 2010; this work). Although 
this stem segment might contain bulges, we found that the 
transition region from the single-strand sequences to the lower 
stem is rather sharp, and three paired bases usually mark the 
beginning of the precursor stem. 

2. A long base-to-loop pathway in which the first cut proceeds in 
a similar way to the shorter version, but then three cuts are re- 
quired until the miRNA is released. This mechanism generates 
additional small RNAs from the precursor, although they usu- 
ally accumulate at low levels. 

3. A short loop-to-base mechanism, where the processing ma- 
chinery is guided by an upper stem segment, and two cuts release 
the mature miRNA. The terminal region of these precursors has 
a conserved length (—42 nt) and a small loop. 

4. A long loop-to-base mechanism in which four sequential DCL1 
cuts process the miRNA precursor (Addo-Quaye et al. 2009; 



Bologna et al. 2009). These precursors have a long conserved 
stem segment that can generate other small RNAs (Rajagopalan 
et al. 2006; Axtell et al. 2007) and is essential for the biogenesis 
of the miRNAs (Bologna et al. 2009). 

We found that the complexity of the miRNA processing 
pathways occurs for both ancient and evolutionarily young se- 
quences, and that members of the same family can be processed in 
different ways, suggesting that the structural determinants can 
change or evolve with time. The processing mechanisms are not 
mutually exclusive, as we found that different RNA domains can 
compete for the miRNA processing machinery in one molecule. In 
this process, alternative miRNAs can be generated from the same 
precursor. 

Plants' miRNA precursors are highly variable in comparison 
with their cognate precursors in animals (Bologna et al. 2009; 
Cuperus et al. 2011). The results obtained here provide the expla- 
nation that diverse RNA structures can serve as miRNA precursors 
in plants, as they can be recognized in at least four different ways. It 
has been shown that animal miRNA precursors can be recognized 
from the loop in vitro by Drosha (Han et al. 2006). However, 
processing of animal miRNAs is compartmentalized in the nuclei 
and the cytoplasm, and after the initial cleavage by Drosha, the 
pre-miRNA is transported to the cytoplasm by Exportin-5 (Lund 
et al. 2004). It is tempting to speculate that the structural re- 
quirements for their nuclear export limit the structural diversity of 
the animal precursor and the alternative processing pathways of 
the animal miRNA precursors. 

In principle, the precise RNA structure of a precursor might 
affect the final activity of the miRNA, as it has been shown for the 
asymmetric bulges located in a miRNA/miRNA* duplex (Manavella 
et al. 2012c). AGO complexes containing miRNAs processed from 
these duplexes are more prone to generate secondary siRNAs from 
their targets (Manavella et al. 2012c). It will be interesting to know 
whether the recognition of the different precursor structures by the 
processing machinery also confers specific features to the mature 
miRNAs. 

Methods 

Plant material and growth 

All plant material was from Arabidopsis thaliana ecotype Col-0 and 
the fieryl mutant. Inflorescence tissue and leaf were harvested 
from plants grown in soil in a growth chamber with 16 h of light 
for 5 wk. Seedlings were grown at 23°C under the same 16 h, long- 
day conditions and were harvested after 2 wk. Seeds of fieryl 
(SALK_020882) were obtained from the Arabidopsis Biological 
Resource Center (ABRC). Total RNA was isolated using TRIzol 
reagents (Invitrogen). 



Figure 8. Switching structural determinants in plant miRNA precursors. (A) Scheme of MIR825. Cuts 
releasing the miRNA/miRNA* are indicated by the green arrows. Gray arrows show other less abundant 
cleavage sites. A frequent cut located at 1 5 nt of a lower stem is indicated with an orange arrow, as well 
as an alternative small RNA detected at low frequency. (B) Scheme of MIR319a precursor and mutants 
with a deleted terminal region and the addition of a structured lower stem. Structured regions in the 
lower stems are highlighted with pink boxes. The cleavage sites analyzed by 5' RACE PCR are indicated 
by red and black lines. The length of the arrows and the number of sequenced clones indicate the 
relative cloning frequency of the intermediates. (C) Phenotypes caused by overexpression of the wild- 
type and mutant MIR319 precursors. Note that wild-type leaves are flat but become crinkled when 
miR31 9 is overexpressed. (Inset) Transgenic plants were classified according to the shape of their leaves. 
At least 50 independent transgenic plants were scored in each case. (D) Small RNA blots of transgenic 
Arabidopsis inflorescences expressing the different precursors. Each sample is a pool of 25 independent 
transgenic plants. The miRNAs are indicated in red and the miRNAs* in blue. 



SPARE library 

Total RNA (1 juug) depleted of ribosomal 
RNA was isolated using the RiboMinus 
Plant Kit for RNA-seq (Invitrogen). The 
RNA was ligated to the RNA oligonucleo- 
tide adaptor (5'-GUUCAGAGUUCUACA 
GUCCGAC-3') using T4 RNA ligase (Fer- 
mentas). The ligated products were purified 
and used as a template in 10 multiplex 
reverse transcription reactions using 
— 18 different precursor-specific oligos 
in each reaction (Supplemental Table 1). 
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First-strand cDNA synthesis was carried out using Superscript III 
Reverse Transcriptase (Invitrogen). Each precursor-specific oligo 
also contained a 5' generic adaptor tale with the sequence 5 '-AG 
CAGAAGACGGCATACGA-3 ' . Then, mixes of the 10 multiplex 
reactions were amplified by PCR using the generic P5 primer 5'-AA 
TGATACGGCGACCACCGACAGGTTCAGAGTTCTACAGTCCGA-3' 
and P7 primer 5 ' -CAAGCAGAAGACGGCATACGA-3 ' . The condi- 
tions for PCR were 18 cycles of 94°C for 20 sec, 60°C for 20 sec, and 
72°C for 20 sec. PCR products were gel-purified and subjected to SBS 
sequencing. 

Transgenes 

Mutated versions of the MIR171 and MIR319 precursor were gen- 
erated by PCR, as described previously (Bologna et al. 2009). The 
precursors were expressed from the 35S promoter using the CHF3 
binary vector (Jarvis et al. 1998). See Supplemental Table 4 for a list 
of binary plasmids and the sequences of the mutant precursors 
used in this study. 

RNA expression analysis 

To perform small RNA blots, 4 to 8 u,g of total RNA was resolved on 
17% polyacrylamide gels under denaturing conditions (7 M urea). 
Antisense DNA oligos to miR319 or miR171 were end-labeled as 
probes with [7-3 2P] ATP using T4 polynucleotide kinase (Fermen- 
tas). Hybridizations were performed overnight at 38°C using Per- 
fect Hyb buffer (Sigma). See Supplemental Figure 4 for details about 
the determination of MIR171 precursors by RT-qPCR. 

Cycle RT-PCR cleavage site mapping of miRNA precursors 

Aliquots of 5 u,g of low molecular weight RNA isolated using the 
mirVana miRNA Isolation Kit (Ambion) were self-ligated with T4 
RNA ligase (Fermentas); the reaction volume was 100 (xl, and the 
reaction was performed overnight at 14°C. After the ligation re- 
action, RNAs were precipitated with ethanol, 3M NaAC, and gly- 
cogen, and further resuspended in water. Next, first-strand cDNA 
synthesis was carried out using Superscript III Reverse Transcrip- 
tase (Invitrogen) with MIR172a loop-specific primer 5'-TGAATC 
ACCACCGTCCATCAAC-3'. PCR reactions were performed using 
MIR172A loop-specific primers 5'-TGAATCACCACCGTCCATC 
AAC-3' and 5'- CTCTCCACAAAGTTCTCTATG-3 ' . The PCR prod- 
ucts were resolved on 3% agarose gels and detected by ethidium 
bromide staining, cloned into pBlueScript vector, and sequenced. 

5' RACE cleavage site mapping of miRNA precursors 

For construction of libraries, polyadenylated RNA molecules were 
isolated using the PolyATtract kit (Promega). Ligation of an RNA 
adaptor, reverse transcription, and 5' RACE were performed accord- 
ing to the procedure described by Bologna et al. (2009). Two nested 
vector-specific reverse oligonucleotides (RACE vector-specific oligo 
5 ' -GTGCGCAATGAAACTGATGC-3 ' and RACE vector-specific oligo 
nested 5'- CGAAACCGATGATACGAACG-3 ') were used for 5' RACE 
for plants overexpressing mutated MIR319 precursors. The PCR 
products were resolved on 3% agarose gels, detected by ethidium 
bromide staining, cloned into pBlueScript vector, and sequenced. 

Bioinformatic analysis 

The secondary structure of the precursors was calculated using the 
program MFOLD (Zuker 2003) with default parameters to a tem- 
perature of 3 7° C. The proximal end of the miRNA/miRNA* duplex 
was defined as position +1. We analyzed the secondary structure, 



and we considered positions that were matched as 0, while those 
unpaired were considered as +1, and an average for all the pre- 
cursors was made. In the precursors' schemes, some large bulges 
and secondary hairpins were simplified and/or indicated by a 
straight line. We have constructed and implemented a bioin- 
formatic pipeline using in-house scripts and publicly available data 
from miRBase to assist in the analysis of the deep-sequencing li- 
braries. We have developed a web-tool for the analysis of SPARE 
data using MySQL as a database. Small RNA sequences were obtained 
from Arabidopsis next-gen sequence DB (http://mpss.udel.edu/ 
at_sRNA/index.php; Nakano et al. 2006) and miRBase database 
(http://www.mirbase.org/; Kozomara and Griffiths-Jones 2011). 

Data access 

Deep-sequencing data with the SPARE results are accessible 
through the NCBI Gene Expression Omnibus (GEO; http://www. 
ncbi.nlm.nih.gov/geo) under accession number GSE46429. 
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